智能论文笔记

DC-cycleGAN: Bidirectional CT-to-MR Synthesis from Unpaired Data

Jiayuan Wang , Q. M. Jonathan Wu , Farhad Pourpanah

分类：计算机视觉 | 机器学习

2022-11-02

Magnetic resonance (MR) and computer tomography (CT) images are two typical types of medical images that provide mutually-complementary information for accurate clinical diagnosis and treatment. However, obtaining both images may be limited due to some considerations such as cost, radiation dose and modality missing. Recently, medical image synthesis has aroused gaining research interest to cope with this limitation. In this paper, we propose a bidirectional learning model, denoted as dual contrast cycleGAN (DC-cycleGAN), to synthesize medical images from unpaired data. Specifically, a dual contrast loss is introduced into the discriminators to indirectly build constraints between real source and synthetic images by taking advantage of samples from the source domain as negative samples and enforce the synthetic images to fall far away from the source domain. In addition, cross-entropy and structural similarity index (SSIM) are integrated into the DC-cycleGAN in order to consider both the luminance and structure of samples when synthesizing images. The experimental results indicate that DC-cycleGAN is able to produce promising results as compared with other cycleGAN-based medical image synthesis methods such as cycleGAN, RegGAN, DualGAN, and NiceGAN. The code will be available at https://github.com/JiayuanWang-JW/DC-cycleGAN.

translated by 谷歌翻译

Deep Learning in Single-Cell Analysis

Dylan Molho , Jiayuan Ding , Zhaoheng Li , Hongzhi Wen , Wenzhuo Tang , Yixin Wang , Julian Venegas , Wei Jin , Renming Liu , Runze Su

分类：人工智能

2022-10-22

Single-cell technologies are revolutionizing the entire field of biology. The large volumes of data generated by single-cell technologies are high-dimensional, sparse, heterogeneous, and have complicated dependency structures, making analyses using conventional machine learning approaches challenging and impractical. In tackling these challenges, deep learning often demonstrates superior performance compared to traditional machine learning methods. In this work, we give a comprehensive survey on deep learning in single-cell analysis. We first introduce background on single-cell technologies and their development, as well as fundamental concepts of deep learning including the most popular deep architectures. We present an overview of the single-cell analytic pipeline pursued in research applications while noting divergences due to data sources or specific applications. We then review seven popular tasks spanning through different stages of the single-cell analysis pipeline, including multimodal integration, imputation, clustering, spatial domain identification, cell-type deconvolution, cell segmentation, and cell-type annotation. Under each task, we describe the most recent developments in classical and deep learning methods and discuss their advantages and disadvantages. Deep learning tools and benchmark datasets are also summarized for each task. Finally, we discuss the future directions and the most recent challenges. This survey will serve as a reference for biologists and computer scientists, encouraging collaborations.

translated by 谷歌翻译

Automatic Error Analysis for Document-level Information Extraction

Aliva Das , Xinya Du , Barry Wang , Kejian Shi , Jiayuan Gu , Thomas Porter , Claire Cardie

分类：自然语言处理

2022-09-15

文档级信息提取（IE）任务最近开始使用端到端的神经网络技术对其句子级别的IE同行进行认真重新审视。但是，对方法的评估在许多维度上受到限制。特别是，Precision/Recell/F1分数通常报道，几乎没有关于模型造成的错误范围的见解。我们基于Kummerfeld和Klein（2013）的工作，为基于转换的框架提出了用于文档级事件和（N- ARY）关系提取的自动化错误分析的框架。我们采用我们的框架来比较来自三个域的数据集上的两种最先进的文档级模板填充方法；然后，为了衡量IE自30年前成立以来的进展，与MUC-4（1992）评估的四个系统相比。

translated by 谷歌翻译

Translating a Visual LEGO Manual to a Machine-Executable Plan

Ruocheng Wang , Yunzhi Zhang , Jiayuan Mao , Chin-Yi Cheng , Jiajun Wu

分类：计算机视觉 | 人工智能

2022-07-25

我们研究了将人类设计师创建的基于图像的，逐步组装手册转换为机器可解剖说明的问题。我们将此问题提出为顺序预测任务：在每个步骤中，我们的模型都读取手册，将要添加到当前形状中的组件定位，并注入其3D姿势。此任务构成了在手动图像和实际3D对象之间建立2D-3D对应关系的挑战，以及对看不见的3D对象的3D姿势估计，因为要在步骤中添加的新组件可以是从前一个步骤中构建的对象。为了应对这两个挑战，我们提出了一个基于学习的新型框架，即手动到执行计划网络（MEPNET），该网络（MEPNET）从一系列手动图像中重建了组装步骤。关键思想是将神经2D关键点检测模块和2D-3D投影算法进行高精度预测和强有力的概括为看不见的组件。 MEPNET在三个新收集的乐高手册数据集和Minecraft House数据集上优于现有方法。

translated by 谷歌翻译

Are Graph Neural Networks Really Helpful for Knowledge Graph Completion?

Juanhui Li , Harry Shomer , Jiayuan Ding , Yiqi Wang , Yao Ma , Neil Shah , Jiliang Tang , Dawei Yin

分类：人工智能 | 机器学习

2022-05-21

知识图（kgs）由于能够存储适用于许多领域的关系知识的能力，因此有助于多种应用。尽管在创造和维护方面进行了巨大的努力，但即使是最大的公斤也远非完整。因此，KG完成（KGC）已成为KG研究最关键的任务之一。最近，该领域的大量文献围绕着使用图神经网络（GNN）学习强大的嵌入，从而利用KGS中的拓扑结构。具体而言，已经做出了专门的努力，以扩展GNN，通常是为简单的同质和单一相关图设计的，以通过设计更复杂的聚合方案而不是相邻节点（关键的节点）（通过设计更复杂的聚合方案）（为GNN绩效）适当利用多关系信息。这些方法的成功自然归因于GNN在简单的多层感知器（MLP）模型上使用，这是由于它们的附加聚合功能。在这项工作中，我们发现简单的MLP模型能够达到与GNN的可比性能，这表明聚集可能并不像以前那样重要。通过进一步的探索，我们显示出仔细的评分功能和损失功能设计对KGC模型性能的影响要大得多，并且实际上不需要聚集。这表明了评分功能设计，损失功能设计和先前工作中的聚集结合，并有很有希望的见解当今最先进的KGC方法的可伸缩性，以及对KGC任务更合适的聚合设计的仔细注意明天。该实现可在线获得：https：//github.com/juanhui28/are_mpnns_helpful。

translated by 谷歌翻译

Graph Neural Networks for Multimodal Single-Cell Data Integration

Hongzhi Wen , Jiayuan Ding , Wei Jin , Yiqi Wang , Yuying Xie , Jiliang Tang

分类：机器学习 | 人工智能

2022-03-03

多模式单细胞技术的最新进展已使从同一细胞中的多个OMICS数据同时采集，从而更深入地了解细胞状态和动力学。但是，从多模式数据，模拟模式之间的关系并更重要的是，将大量的单模式数据集纳入下游分析是一项挑战。为了应对这些挑战并相应地促进了多模式的单细胞数据分析，已经引入了三个关键任务：$ \ textit {模式预测} $，$ \ textit {modital {modital {modational conterative} $和$ \ textit {intimit {interion {intim interding} $。在这项工作中，我们提出了一个通用图形神经网络框架$ \ textit {scmognn} $来解决这三个任务，并表明$ \ textit {scmognn} $与最新的任务相比，在所有三个任务中都表现出了卓越的结果。艺术和传统方法。我们的方法是\ textit {模式预测}的整体排名的官方获奖者，来自神经2021竞赛\ footNote {\ url {https://openproblems.bio/neurips_2021/}}，我们的所有方法都已整合到我们的所有实现中舞蹈软件包\ footNote {\ url {https://github.com/omicsml/dance}}}。

translated by 谷歌翻译

TextRGNN: Residual Graph Neural Networks for Text Classification

Jiayuan Chen , Boyu Zhang , Yinfei Xu , Meng Wang

分类：自然语言处理 | 人工智能

2021-12-30

最近，基于图形神经网络（GNN）的文本分类模型引起了越来越多的关注。大多数这些模型采用类似的网络范例，即使用预训练节点嵌入初始化和两层图卷积。在这项工作中，我们提出了Textrgnn，一种改进的GNN结构，它引入了剩余连接以加深卷积网络深度。我们的结构可以获得更广泛的节点接收领域，有效地抑制节点特征的过平滑。此外，我们将概率语言模型集成到图形节点嵌入的初始化中，从而可以更好地提取非图形语义信息。实验结果表明，我们的模型是一般和高效的。无论是语料库级别还是文本级别，它都可以显着提高分类准确性，并在各种文本分类数据集中实现SOTA性能。

translated by 谷歌翻译

Shared Coupling-bridge for Weakly Supervised Local Feature Learning

Jiayuan Sun , Jiewen Zhu , Luping Ji

分类：计算机视觉

2022-12-14

Sparse local feature extraction is usually believed to be of important significance in typical vision tasks such as simultaneous localization and mapping, image matching and 3D reconstruction. At present, it still has some deficiencies needing further improvement, mainly including the discrimination power of extracted local descriptors, the localization accuracy of detected keypoints, and the efficiency of local feature learning. This paper focuses on promoting the currently popular sparse local feature learning with camera pose supervision. Therefore, it pertinently proposes a Shared Coupling-bridge scheme with four light-weight yet effective improvements for weakly-supervised local feature (SCFeat) learning. It mainly contains: i) the \emph{Feature-Fusion-ResUNet Backbone} (F2R-Backbone) for local descriptors learning, ii) a shared coupling-bridge normalization to improve the decoupling training of description network and detection network, iii) an improved detection network with peakiness measurement to detect keypoints and iv) the fundamental matrix error as a reward factor to further optimize feature detection training. Extensive experiments prove that our SCFeat improvement is effective. It could often obtain a state-of-the-art performance on classic image matching and visual localization. In terms of 3D reconstruction, it could still achieve competitive results. For sharing and communication, our source codes are available at https://github.com/sunjiayuanro/SCFeat.git.

translated by 谷歌翻译

Multi-skill Mobile Manipulation for Object Rearrangement

Jiayuan Gu , Devendra Singh Chaplot , Hao Su , Jitendra Malik

分类：机器人 | 机器学习

2022-09-06

我们研究了一种模块化方法，可以解决对象重排的长马移动操作任务，该任务将完整的任务分解为一系列子任务。为了解决整个任务，先前的工作将具有点目标导航技能的多个固定操作技巧，这些技巧是在子任务上单独学习的。尽管比整体端到端的RL政策更有效，但该框架遭受了技能链条的复杂错误，例如导航到一个不良位置，在这种情况下，固定操作技能无法达到其目标进行操纵。为此，我们建议操纵技巧应包括移动性，以便从多个位置与目标对象进行互动，同时导航技能可能具有多个终点，从而导致成功的操纵。我们通过实施移动操纵技能而不是固定技能来实现这些想法，并训练接受区域目标而不是积分目标的导航技能。我们在家庭助理基准（HAB）中评估了三个挑战性的长途移动操纵任务M3，并在3个挑战性的长途移动操纵任务上评估了我们的多技能，并且与基线相比表现出了出色的性能。

translated by 谷歌翻译

Efficient Joint-Dimensional Search with Solution Space Regularization for Real-Time Semantic Segmentation

Peng Ye , Baopu Li , Tao Chen , Jiayuan Fan , Zhen Mei , Chen Lin , Chongyan Zuo , Qinghua Chi , Wanli Ouyan

分类：计算机视觉 | 人工智能

2022-08-10

语义细分是计算机视觉中的一个流行研究主题，并且在其上做出了许多努力，结果令人印象深刻。在本文中，我们打算搜索可以实时运行此问题的最佳网络结构。为了实现这一目标，我们共同搜索深度，通道，扩张速率和特征空间分辨率，从而导致搜索空间约为2.78*10^324可能的选择。为了处理如此大的搜索空间，我们利用差异架构搜索方法。但是，需要离散地使用使用现有差异方法搜索的体系结构参数，这会导致差异方法找到的架构参数与其离散版本作为体系结构搜索的最终解决方案之间的离散差距。因此，我们从解决方案空间正则化的创新角度来缓解离散差距的问题。具体而言，首先提出了新型的解决方案空间正则化（SSR）损失，以有效鼓励超级网络收敛到其离散。然后，提出了一种新的分层和渐进式解决方案空间缩小方法，以进一步实现较高的搜索效率。此外，我们从理论上表明，SSR损失的优化等同于L_0-NORM正则化，这说明了改善的搜索评估差距。综合实验表明，提出的搜索方案可以有效地找到最佳的网络结构，该结构具有较小的模型大小（1 m）的分割非常快的速度（175 fps），同时保持可比较的精度。

translated by 谷歌翻译